Corpus: mrj_wikipedia_2016_10K

Other corpora

2.2.5 Most frequent word beginnings

The most frequent word beginnings as character N-grams for N=1...5 with Zipf's diagram


Zipf's diagram for word beginnings


Gnuplot diagram

Top Characters
word rank frequency n-gram
1 2091 к-
2 1875 п-
3 1254 с-
4 1144 т-
5 1126 1-
Top Character Bigrams
word rank frequency n-gram
1 462 ко-
2 342 по-
3 306 ка-
4 297 пр-
5 296 19-
Top Character Trigrams
word rank frequency n-gram
1 138 про-
2 119 сир-
3 99 кол-
4 87 кон-
5 86 тӹн-
Top Character 4-Grams
word rank frequency n-gram
1 74 тӹнг-
2 62 йӹлм-
3 62 пӓшӓ-
4 61 лыды-
5 53 сирӹ-
Top Character 5-Grams
word rank frequency n-gram
1 55 йӹлмӹ-
2 50 лыдыш-
3 48 тӹнгӓ-
4 34 культ-
5 34 вашта-
523 msec needed at 2018-01-05 07:10